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^2 (57) Abstract: The present invention relates to protein-protein interactions in adipocytes. More specifically, the present invention 
relates to complexes of polypeptides or polynucleotides encoding the polypeptides, fragments of the polypeptides, antiodies to the 
complexes. Selected Interacting Domains (SID®) which are identified due to the protein-protein interactions, methods for screening 

^ drugs for agents which modulate the interaction of proteins and pharmaceutical compositions that are capable of modulating die 

^ protein-protein interactions. 
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MORE AND MORE PROTEIN-PROTEIN INTERACTIONS 
IN ADIPOCYTE CELLS 

5 . 

FIELD OF THE INVENTION 

10. The present invention relates to proteins that interact with adipocytes. More 

specifically, the present invention relates to complexes of polypeptides or 
polynucleotides encoding the polypeptides, fragments of the polypeptides, antibodies to 
the complexes, Selected Interacting Domains (SID®) which are identified due to the 
protein-protein interactions, methods for screening drugs for agents which modulate the 

15 interaction of proteins and pharmaceutical compositions that are capable of modulating 
the protein-protein interactions. 

In another embodiment the present invention provides a protein-protein interaction 
map called a PIM® which is available in a report relating to the protein-protein 
interactions of adipocytes. 

20 In yet another embodiment the present invention relates to the identification of 

additional proteins in the pathway common to the proteins described therein, such as 
metabolic pathways. 

BACKGROUND 

25 Most biological processes involve specific protein-protein interactions. 

Protein-protein interactions enable two or more proteins to associate. A large 
number of non-covalent bonds form between the proteins when two protein surfaces 
are precisely matched. These bonds account for the specificity of recognition. Thus, 
protein-protein interactions are involved, for example, in the assembly of enzyme 

30 subunits, in antibody-antigen recognition, in the formation of biochemical complexes, 
in the correct folding of proteins, in the metabolism of proteins, in the transport of 
proteins, in the localization of proteins, in protein turnover, in first translation 
modifications, in the core structures of viruses and in signal transduction. 

General methodologies to identify interacting proteins or to study these 

35 interactions have been developed. Among these methods are the two-hybrid system 
originally developed by Fields and co-woricers and described, for example, in U.S. 
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Patent Nos. 5.283,173, 5,468,614 and 5,667,973, which are hereby incorporated by 
reference. 

The earliest and simplest two-hybrid system, which acted as basis for 
development of other versions, is an in vivo assay between two speciflcaily 

5 constructed proteins. The first protein, l<nown in the art as the "bait protein" is a 
chimeric protein which binds to a site on DNA upstream of a reporter gene by means 
of a DNA-binding domain or BD. Commonly, the binding domain is the DNA-binding 
domain from either Gal4 or native E. coli LexA and the sites placed upstream of the 
reporter are Gal4 binding sites or LexA operators, respectively. 

10 The second protein is also a chimeric protein known as the "prey" in the art. This 

second chimeric protein cames an activation domain or AD. This activation domain 
is typically derived from Gal4, from VP16 or from B42. 

Besides the two hybrid systems, other Improved systems have been developed 
to detected protein-protein interactions. For example, a two-hybrid plus one system 

15 was developed that allows the use of two proteins as halt to screen available cDNA 
libraries to detect a third partner. This method pemilts the detection between 
proteins that are part of a larger protein complex such as the RNA polymerase II 
holoenzyme and the TFIIH or TFIID complexes. Therefore, this method, in general, 
penults the detection of ternary complex fomiation as well as Inhibitors preventing 

20 the interaction between the two previously defined fused proteins. 

Another advantage of the two-hybrid plus one syistem is that It allows or prevents 
the fbnnation of the transcriptional activator since the third partner can be expressed 
from a conditional promoter such as the methionine-repressed Met25 promoter which 
is positively regulated In medium lacking methionine. The presence of the 

25 methionine-regulated promoter provides an excellent control to evaluate the 
activation or inhibition properties of the third partner due to Its "on" and "off" switch for 
the fonnation of the transcriptional activator. The three-hybrid method is described, 
for example in Tirode et al., Tlie Journal of Biological Chemistry, 272, No. 37 pp. 
22995-22999 (1997) incorporated herein by reference. 

30 Besides the two and two-hybrid plus one systems, yet another variant is that 

described in Vidal et al, Proc. Natl. Sci. 93 pgs. 10315-10320 called the reverse two- 
and one-hybrid systems where a collection of molecules can be screened that inhibit 
a specific protein-protein or protein-DNA Interactions, respectively. 

2 
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A summary of the available methodologies for detecting protein-protein 
interactions is described In Vidal and Legrain, Nucleic Acids Researcti Vol, 27. No. 4 
pgs. 919-929 (1999) and Legrain and Selig. FEBS Letters 480 pgs. 32-36 (2000) 
which references are incorporated herein by reference. 

However, the above conventionally used approaches and especially the 
commonly used two-hybrid methods have their drawbacks. For example, it is known 
in the art that, more often than not, false positives and false negatives exist in the 
screening method. In fact, a doctrine has been developed in this field for interpreting 
the results and in common practice an additional technique such as co- 
immunopirecipitation or gradient sedimentation of the putative interactors from the 
appropriate cell or tissue type are generally performed. The methods used for 
interpreting the results are described by Brent and Finiey. Jr. in Ann. Rev. Genet, 31 
pgs. 663-704 (1997). Thus, the data interpretation is very questionable using the 
conventional systems. 

One method to overcome the difficulties encountered with the methods in the 
prior art is described in W099/42612, incorporated herein by reference. This method 
is similar to the two-hybrid system described in the prior art in that it also uses bart 
and prey polypeptides. However, the difference with this method is that a step of 
mating at least one first haploid recombinant yeast cell containing the prey 
polypeptide to be assayed with a second haploid recombinant yeast cell containing 
the bait polynucleotide is perfonned. Of course the person skilled in the art would 
appreciate that either the first recombinant yeast cell or the second recombinant 
yeast cell also contains at least one detectable reporter gene that is activated by a 
polypeptide including a transcriptional activation domain. 

The method described in W099/42612 penmits the screening ofimore prey 
polynucleotides with a given bait polynucleotide in a single step than in the prior art 
systems due to the cell to cell mating strategy between haploid yeast cells. 
Furthemnore, this method is more thorough and reproducible, as well as sensitive. 
Thus, the presence of false negatives and/or false positives is extremely minimal as 
compared to the conventional prior art methods. 

The causes of non-insulin dependent diabetes mellitus (NIDDM) and obesity are 
often related to defects or problems with adipose tissue. Adipocytes play a critipal 
role in lipid storage and metabolism. Adipocytes also act as endocrine cells to 
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influence physiological parameters such as insulin sensitivity and body weight (Flier, 
et al., Cell, (1995) 80: 15-18). For example, the ob gene encodes leptin, an 
adipocyte secreted endocrine factor (Zhang, et al.. Nature (1994) 372: 425-432). 
Leptin has been shown to reduce body weight and blood glucose in obese, diabetic 
rodents (Pelley mounter, et al., Science, (1995) 269: 540-543). 

NIDDM is treated predominately with insulin. However, insulin is not convenient 
to use in that it must be injected 2-4 times per day and must be stored properly to 
prevent loss of efficacy. Other drugs used to treat NIDDM include troglitazone 
("Rezulin"). a PPARY agonist, Glucophage and sulfonylureas. Unfortunately, there 
are safety concerns related to the use of these drugs. The identification of safe, 
effective, orally available dnjgs for the treatment of NIDDM would greatly enhance 
the quality of life of patients who suffer from this disease. 

Several adipocyte-specific enzymes and receptors have been shown to be 
important targets for anti-obesity and anti-diabetic drug discovery. For example, 
agonists of the p3 adrenergic receptor, which is found predominantly in the adipose 
tissue in man (Amer, et al., New England Journal of Medicine, (1995) 333: 382-383), 
have anti-obesity and anti-diabetic properties In rodents and are currently in phase 
ll/lll trials in man. The thiazolldinedione class of compounds (TZDs), including 
troglitazone and ciglitazone, has been shown to Improve insulin sensitivity and 
thereby reduce hyperglycemia and hyperiipldemla conditions In rodents and In 
humans (Saltiel, et al.. Diabetes, (1996) 45: 1661-1669; Sreenan, et al.. American 
Journal Physiol, (1996) 271: E742-E747; Nolan, etal.. New England Journal of 
Medicine, (1994) 331: 1188-1193. Troglitazone C Rezulin") is approved for use In the 
U. S. and Japan. Many TZDs, including troglitaa)ne and ciglitazone, are potent 
activators of Peroxisome Proliferator Activated Receptor gamma (PPARy), a member 
of the nuclear receptor family of transcription factors (Tontonoz, etal.. Cell, (1994) 79: 
1147-1156; Lehmann, etal.. Journal of Biological Chemistry, (1995) 270: 12953- 
12955). PPARB, is a key regulator of adipocyte differentiation and is most abundant 
in adipose tissue. 

This shows that it is still needed to explore all mechanisms of adipocyte 
differentiation and to identify drug targets for metabolism diseases. 

The adipocytes (differentiated PAZ6 adipocytes) studied in the present invention 
are obtained by the method described in the PCT patent application WO96/34100. 
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SUMMARY OF THE INVENTION 

The present invention relates to identifying protein-protein interactions in 
adipocytes. 

The present invention also relates to identifying protein-protein interactions in 
adipocytes for the development of more effective and better targeted therapeutic 
applications. 

The present invention is also aimed at identifying complexes of polypeptides or 
polynucleotides encoding the polypeptides and fragments of the polypeptides of 
adipocytes. 

The present invention also relates to identifying antibodies to these complexes of 
polypeptides or polynucleotides encoding the polypeptides and fragments of the 
polypeptides of adipocytes including polyclonal, as well as monoclonal antibodies 
that are used for detection. 

The present invention also concerns the identification of selected interacting 
domains of the polypeptides, called SID® polypeptides. 

Furthermore, the present invention concems the identification of selected 
interacting domains of the polynucleotides, called SID® polynucleotides. 

Also, the present invention relates to generating protein-protein interaction maps 
called PIM®s. 

The present invention also provides a method for screening drugs for agents 
which modulate the interaction of proteins and pharmaceutical compositions that are 
capable of modulating the protein-protein interactions in adipocytes. 

The present Invention also relates to administering the nucleic acids of the 
present invention via gene therapy. 

Also, the present invention provides protein chips or protein microanays. 

In another embodiment, the present invention provides a report in, for example 
paper, electronic and/or digital forms, concerning the protein-protein interactions, the 
modulating compounds and the like as well as a PIM®. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a schematic representation of the pB1 plasmid. 
Fig. 2 is a schematic representation of the pB5 plasmid. 
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Fig. 3 is a scliematic representation of the pB6 piasmid. 

Fig. 4 is a schematic representation of the pB13 piasmid. 

Fig. 5 is a schematic representation of the pB14 piasmid. 

Fig. 6 is a schematic representation of the pB20 piasmid. 

Fig. 7 is a schematic representation of the pP1 piasmid. 

Fig. 8 is a schematic representation of the pP2 piasmid. 

Fig. 9 is a schematic representation of the pP3 piasmid. 

Fig. 10 is a schematic representation of the pP6 piasmid. 

Fig. 11 is a schematic representation of the pP7 piasmid. 

Fig. 12 is a schematic representation of vectors expressing the T25 fragment. 

Fig. 13 is a schematic representation of vectors expressing the T18 fragment. 

Fig. 14 Is a schematic representation of various vectors of pCmAHLI. pT25 and 

pT18. 

Fig. 15 Is a schematic representation identifying the SID®'s of adipocytes. In this 
figure the "Full-length prey protein" Is the Open Reading Frame (ORF) or coding 
sequence (CDS) where the identified prey polypeptides are included. The Selected 
Interaction Domain (SID®) Is detemilned by the commonly shared polypeptide 
domain of every selected prey fragment 
Fig. 16 is a protein map (PIM®). 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

As used herein the terms "polynucleotides", "nucleic acids" and 
"oligonucleotides" are used Interchangeably and include, but are not limited to RNA, 
DNA, RNA/DNA sequences of more than one nucleotide in either single chain or 
duplex fomi. The polynucleotide sequences of the present invention may be 
prepared from any known method including, but not limited to, any synthetic method, 
any recombinant method, any ex vivo generation method and the like, as well as 
combinations thereof. 

Polynucleotides which can hybridize to any of the polynucleotides discussed 
above are also covered by the present invention. Such polynucleotides are refen^ed to 
herein as "hybridizing polynucleotides. Hybridizing polynucleotides can be useful as 
probes or primers, for example. 
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According to an embodiment of the present invention, sucli liybridizing molecules 
are at least 10 nucleotides in length. According to another embodiment, such hybridizing 
molecules are at least 25 or at least 50 nucleotides in length. 

In an embodiment, the hybridizing molecules will hybridize to any of the 

5 polynucleotides of the present invention under stringent hybridization conditions. One 
example of stringent hybridization conditions is where attempted hybridization is earned 
out at a temperature of from about 35°C to about 65°C using a salt solution which is about 
0.9 molar. However, the skilled person will be able to vary such conditions as appropriate 
in order to take into account \'ariables such as probe length, base composition, type of ions 

10 present, etc. 

The term "polypeptide" means herein a polymer of amino acids having no 
specific length. Thus, peptides, oligopeptides and proteins are included in the 
definition of "polypeptide" and these terms are used Interchangeably throughout the 
specification, as well as in the claims. The term "polypeptide" does not exclude post- 
is translational modifications such as polypeptides having covalent attachment of 
glycosyl groups, aceteyl groups, phosphate groups, lipid groups and the like. Also 
encompassed by this definition of "polypeptide" are homologs thereof. 

By the temi "homologs" is meant structurally similar genes contained within a 
given species, orthologs are functionally equivalent genes from a given species or 
20 strain, as determined for example, in a standard complementation assay. Thus, a 
polypeptide of interest can be used not only as a model for identifying similiar genes 
in given strains, but also to Identity homologs and orthologs of the polypeptide of 
interest in other species. The orthologs, for example, can also be identified in a 
conventional complementation assay. In addition or alternatively, such orthologs can 
25 be expected to exist in bacteria (or other kind of cells) in the same branch of the 
phylogenic tree, as set forth, for example, at ftp://fte.cme.insu.edu/pu b/rdp/SSU- 
rRNA/SSU/Prok.phvlo . 

As used herein the term "prey polynucleotide" means a chimeric polynucleotide 
encoding a polypeptide comprising (i) a specific domain; and (ii) a polypeptide that is 
30 to be tested for interaction with a bait polypeptide. The specific domain is preferably 
a transcriptional activating domain. 

As used herein, a "bait polynucleotide" is a chimeric polynucleotide encoding a 
chimeric polypeptide comprising (i) a complementary domain; and (ii) a polypeptide 

7 



wo 02/086122 



PCT/EP02/03768 



that is to be tested for InteracBoh with at least one prey polypeptide. The 
complementary domain is preferably a DNA-binding domain that recognizes a 
binding site that Is further detected and is contained in the host organism. 

As used herein "complementary domain" is meant a functional constitution of the 
activity when bait and prey are interacting; for example, enzymatic activity. 

As used herein "specific domain" is meant a functional interacting activation 
domain that may work through different mechanisms by interacting directly or 
indirectly through intennediary proteins with RNA polymerase II or Ill-associated 
proteins in the vicinity of the transcription start site. 

As used herein the term "complementary" means that, for example, each base of 
a first polynucleotide is paired with the complementary base of a second 
polynucleotide whose orientation is reversed. The complementary bases are A and 
T (or A and U) or C and G. 

The term "sequence Identity" refers to the identity between two peptides or 
between two nucleic acids. Identity between sequences can be determined by 
comparing a position in each of the sequences which may be aligned for the 
purposes of comparison. When a position In ttie compared sequences is occupied 
by the same base or amino acid, then the sequences are Identical at tiiat position. A 
degree of sequence identity between nucleic acid sequences is a function of tiie 
number of identical nucleotides at positions shared by tiiese sequences. A degree of 
identity between amino acid sequences is a function of the number of Identical amino 
acid sequences tiiat are shared between these sequences. Since two polypeptides 
may each (i) comprise a sequence (i.e.. a portion of a complete polynucleotide 
sequence) that Is similar between two polynucleotides, and (ii) may furOier comprise 
a sequence that Is divergent between two polynucleotides, sequence identity 
comparisons between two or more polynucleotides over a "comparison window" 
refers to the conceptual segment of at least 20 contiguous nucleoti'de positions 
wherein a polynucleotide sequence may be compared to a reference nucleotide 
sequence of at least 20 contiguous nucleotides and wherein the portion of tiie 
polynucleotide sequence in the comparison window may comprise additions or 
deletions (i.e., gaps) of 20 percent or less compared to the reference sequence 
(which does not comprise additions or deletions) for optimal alignment of the two 
sequences. 



8 
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To determine the percent identity of two amino acids sequences or two nucleic 
acid sequences, tlie sequences are aligned for optimal comparison. For example, 
gaps can be introduced in the sequence of a first amino acid sequence or a first 
nucleic acid sequence for optimal alignment with the second amino acid sequence or 

5 second nucleic acid sequence. The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. 
When a position in the first sequence is occupied by the same amino acid residue or 
nucleotide as the corresponding position in the second sequence, the molecules are 
identical at that position. 

10 The percent identity' between the two sequences is a function of the number of 

identical positions shared by the sequences. Hence % identity = number of identical 
positions / total number of overlapping positions X 1 00. 

In this comparison the sequences can be the same length or can be different in 
length. Optimal alignment of sequences for determining a comparison window may 

15 be conducted by the local homology algorithm of Smith and Watemrian (J. Theor, 
BioL, 91 (2) pgs. 370-380 (1981), by the homology alignment algorithm of 
Needleman and Wunsch. J. MioL BioL, 48(3) pgs. 443-453 (1972). by the search for 
similarity via the method of Pearson and Lipman, PNAS, USA, 85(5) pgs. 2444-2448 
(1988). by computerized implementations of these algorithms (GAP. BESTFIT, 

20 FASTA and TFASTA in the Wisconsin Genetics Software Package Release 7.0, 
Genetic Computer Group, 575, Science Drive, Madison. Wisconsin) or by inspection. 
The best alignment (i.e., resulting in the highest percentage of identity oyer the 
comparison window) generated by the various methods is selected. 

The ternn "sequence identity" means that two polynucleotide or polypeptide 

25 sequences are identical (i.e., on a nucleotide by nucleotide or an amino add by 
amino acid basis) over the window of comparison. The term "percentage of 
sequence identity" is calculated by comparing two optimally aligned sequences over 
the window of comparison, detemiining the number of positions at which the Identical 
nucleic acid base (e.g., A, T, C, G, U. or I) occurs in both sequences to yield the 

30 number of matched positions, dividing the number of matched positions by the total 
number of positions in the window of comparison (i.e., the window size) and 
multiplying the result by 100 to yield the percentage of sequence Identity. The same 
process can be applied to polypeptide sequences. 



9 
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The percentage of sequence identity of a nucleic acid sequence or an amino acid 
sequence can also be calculated using BLAST software (Version 2.06 of Septennber 
1 998) with the default or user defined parameter. 

The term "sequence similarity" means that amino acids can be modified while 
retaining the same function. It is known that amino acids are classified according to 
the nature of their side groups and some amino acids such as the basic amino acids 
can be interchanged for one another while their basic function is maintained. 

The term "isolated" as used herein means that a biological material such as a 
nucleic acid or protein has been removed from its original environment in which it is 
naturally present. For example, a polynucleotide present in a plant, mammal or 
animal is present in its natural state and is not considered to be isolated. The same 
polynucleotide separated from the adjacent nucleic acid sequences in which it is 
naturally inserted in the genome of the plant or animal Is considered as being 
"isolated." 

The term "isolated" is not meant to exclude artificial or synthetic mixtures with 
other compounds, or the presence of impurities which do not interfere with the 
biological activity and which hiay be present, for example, due to incomplete 
purification, addition of stabilizers or mixtures with phamiaceutically acceptable 
excipients and the like. 

"Isolated polypeptide" or "isolated protein" as used herein means a polypeptide 
or protein which is substantially free of those compounds that are nonnally 
associated with the polypeptide or protein in a naturally state such as other proteins 
or polypeptides, nucleic acids, carbohydrates, lipids and the like. 

The term "purified" as used herein means at least one order of magnitude of 
purification is achieved, preferably two or three orders of magnitude, most preferably 
four or five orders of magnitude of purification of the starting material or of the 
natural material. Thus, the term "purified" as utilized herein does not mean that the 
material is 100% purified and thus excludes any other material. 

The term "variants" when refemng to, for example, polynucleotides encoding a 
polypeptide variant of a given reference polypeptide are polynucleotides that differ 
from tiie reference polypeptide but generally maintain their functional characteristics 
of the reference polypeptide. A variant of a polynucleotide may be a naturally 
occurring allelic variant or it may be a variant that is known naturally not to occur. 

10 
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Such non-naturally occurring variants of the reference polynucleotide can be made 
by, for example, mutagenesis techniques, including those mutagenesis techniques 
that are applied to polynucleotides, cells or organisms. 

Generally, differences are limited so that the nucleotide sequences of the 

5 reference and variant are closely similar overall and, in many regions identical. 

Variants of polynucleotides according to the present invention include, but are not 
limited to, nucleotide sequences which are at least 95% identical after alignment to 
the reference polynucleotide encoding the reference polypeptide. These variants can 
also have 96%, 97%, 98% and 99.999% sequence identity to the reference 

10 polynucleotide. 

Nucleotide changes present in a variant polynucleotide may be silent, which 
means that these changes do not alter the amino acid sequences encoded by the 
reference polynucleotide. 

Substitutions, additions and/or deletions can involve one or nrare nucleic acids. 

15 Alterations can produce conservative or non-conservative amino acid substitutions, 
deletions and/or additions. 

Variants of a prey or a SID® polypeptide encoded by a variant polynucleotide 
can possess a higher affinity of binding and/or a higher specificity of binding to its 
protein or polypeptide counterpart, against which it has been initially selected. In 

20 another context, variants can also loose their ability to bind to their protein or 
polypeptide counterpart. 

By "fragment of a polynucleotide" or "fragment of a SID® polynucleotide" is 
meant that fragments of these sequences have at least 12 consecutive nucleotides, 
or between 12 and 5,000 consecutive nucleotides, or between 12 and 10,000 

25 consecutive nucleotides, or between 12 and 20,000 consecutive nucleotides. 

By "fragment of a polypeptide" or "fragment of a SID® polypeptide" is meant that 
fragments of these sequences have at least 4 consecutive amino acids, or between 4 
and1,700 consecutive amino acids, or between 4 and 3,300 consecutive amino 
acids, or between 4 and 6,600 consecutive amino acids. 

30 By "anabolic pathway" is meant a reaction or series of reactions in a metabolic 

pathway that synthesize complex molecules from simpler ones, usually requiring the 
input of energy. An anabolic pathway is the opposite of a catabolic pathway. 
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As used herein, a "catabolic pathway" is a series of reactions in a metabolic 
pathway that break down complex compounds into simpler ones, usually releasing 
energy in the process. A catabolic pathway is the opposite of an anabolic pathway. 

As used herein, "drug metabolism" is meant the study of how drugs are 
processed and broken down by the body. Drug metabolism can involve the study of 
enzymes that break down drugs, the study of how different drugs interact within the 
body and how diet and other ingested compounds affect the way the body processes 
dnjgs. 

As used herein, "metabolism" means the sum of all of the enzyme-catalyzed 
reactions in living cells that transfomri organic molecules. 

By "secondary metabolism" is meant pathways producing specialized metabolic 
products that are not found in every cell. 

As used herein, "SID®" means a Selected Interacting Domain and is identified as 
follows: for each bait polypeptide screened, selected prey polypeptides are 
compared. Overiapping fragments in the same ORF or CDS define the selected 
interacting domain. 

As used herein the tenm "PIM®" means a protein-protein interaction map. This 
map is obtained from data acquired from a number of separate screens using 
different bait polypeptides and is designed to map out all of the Interactions between 
the polypeptides. 

The temi "affinity of binding", as used herein, can be defined as the affinity 
constant Ka when a given SID® polypeptide of the present invention which binds to a 
polypeptide and is the following mathematical relationship: 

[SID®/polypeptide complex] 
Ka = 

[free SID®] [free polypeptide] 
herein [free SID®], [free polypeptide] and [SID®/polypeptide complex] consist of the 
concentrations at equilibrium respectively of the free SID® polypeptide, of the free 
polypeptide onto which the SID® polypeptide binds and of the complex formed 
between SID® polypeptide and the polypeptide onto which said SID® polypeptide 
specifically binds. 

The affinity of a SID® polypeptide of the present invention or a variant thereof for 
its polypeptide counterpart can be assessed, for example, on a Biacore™ apparatus 

12 
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marketed by Amersham Pharmacia Biotech Company such as described by Szabo ef 
a/. {Curr Opin Struct Biol 5 pgs. 699-705 (1995)) and by Edwards and Leartherbanrow 
{Anal. Biochem 246 pgs. 1-6 (1997)). 

As used herein the phrase "at least the same affinity" with respect to the binding 
affinity between a SID® polypeptide of the present invention to another polypeptide 
means that the Ka is identical or can be at least two-fold, at least three-fold or at least 
five fold greater than the Ka value of reference. 

As used herein, the term "modulating compound" means a compound that 
inhibits or stimulates or can act on another protein which can inhibit or stimulate the 
protein-protein interaction of a complex of at least two polypeptides or the protein- 
protein interaction of at least two polypeptides. 

More specifically, the present invention comprises complexes of polypeptides or 
polynucleotides encoding the polypeptides composed of a bait polypeptide, or a bait 
polynucleotide encoding a bait polypeptide and .a prey polypeptide or a prey 
polynucleotide encoding a prey polypeptide. The prey polypeptide or prey 
polynucleotide encoding the prey polypeptide is capable of interacting with a bait 
polypeptide of interest in various hybrid systems. 

As described in the Background of the present invention, there are various 
methods known in the art to identify prey polypeptides that interact with bait 
polypeptides of interest. These methods include, but are not limited to, generic two- 
hybrid systems as described by Fields ef a/. {Nature, 340:245-246 (1989)) and more 
specifically in U.S. Patent Nos. 5.283,173, 5,468,614 and 5,667.973. which are 
hereby incorporated by reference; the reverse two-hybrid system described by Vidal 
ef al. {supra)] the two plus one hybrid method described, for example, in Tirode ef a/. 
{supra)', the yeast fonward and reverse 'n -hybrid systems as described in Vidal and 
Legrain (supra); the method described in WO 99/42612; those methods described in 
Legrain ef al. {FEBS Letters 480 pgs. 32-36 (2000)) and the like. 

The present invention is not limited to the type of method utilized to detect 
protein-protein interactions and therefore any method known in the art and variants 
thereof can be used. It is however better to use the method described in 
W099/42612 or WOOO/66722, both references incorporated herein by reference due 
to the methods' sensitivity, reproducibility and reliability. 
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Protein-protein interactions can also be detected using complementation assays 
such as tliose described by Pelletier et al. at 
httD://ww.abrf.Qrg/JBT/Articles/JBT0012/ibtQ012.html . WO 00/07038 and WO98/34120. 

Although the above methods are described for applications in the yeast system, the 
present invention is not limited to detecting protein-protein interactions using yeast, 
but also includes similar methods that can be used in detecting protein-protein 
interactions in, for example, mammalian systems as described, for example in 
Takacs et ai {Proc. NatL Acad, ScL, USA, 90 (21):10375-79 (1993)) and Vasavada 
et ai {Proa NatL Acad, Sc/., USA, 88 (23):10686-90 (1991)), as well as a bacterial 
two-hybrid system as described in Karimova et ai (1998), W099/28746, WO 
00/66722 and Legrain et ai {FEBS Letters, 480 pgs. 32-36 (2000)). 

The above-described methods are limited to the use of yeast, mammalian cells 
and Escherichia coli cells, the present invention is not limited in this manner. 
Consequently, mammalian and typically human cells, as well as bacterial, yeast, 
fungus, insect, nematode and plant cells are encompassed by the present invention 
and may be transfected by the nudeic acid or recombinant vector as defined herein. 
Examples of suitable ceils include, but are not limited to, VERO cells. HELA cells, 
such as ATCC No. CCL2. CHO cell lines such as ATCC No. CCL61, COS cells such 
as COS-7 cells and ATCC No. CRL 1650 cells, W138. BHK, HepG2, 3T3 such as 
ATCC No. CRL6361 , A549, PC12, K562 cells, 293 cells, Sf9 cells such as ATCC No, 
CRL1711 and Cv1 cells such as ATCC No. CCL70. 

Other suitable cells that can be used in the present invention include, but are not 
limited to, prokaryotic host cells strains such as Escherichia coli, (e.g., strain DH5-a), 
Bacillus subtilis. Salmonella typhimurium, or strains of the genera of Pseudomonas, 
Streptomyces and Staphylococcus. 

Further suitable cells that can be used in the present invention include yeast cells 
such as those of Saccharomyces such as Saccharomyces cerevisiae. 

The bait polynucleotide, as well as the prey polynucleotide can be prepared 
according to the methods known in the art such as those described above in the 
publications and patents reciting the known method perse. 

The bait and the prey polynucleotide of the present invention is obtained from 
adipocyte's cDNA (human differentiated PAZ6 adipocytes), or variants of cDNA 
fragment from a library of human differentiated PAZ6 adipocytes, and fragments from 
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the genome or transcriptome of human differentiated PAZ6 adipocytes ranging from 
about 12 to about 5,000, or about 12 to about 10,000 or from about 12 to about 
20,000. Tlie prey polynucleotide is then selected, sequenced and identified. 

A human differentiated PAZ6 adipocytes prey library is prepared from the human 
differentiated PAZ6 adipocytes and constructed in the specially designed prey vector 
pP6 as shown in Figure 10 after ligation of suitable linkers such that every cDNA 
insert is fused to a nucleotide sequence in the vector that encodes the transcription 
activation domain of a reporter gene. Any transcription activation domain can be 
used in the present invention. Examples include, but are not limited to, Gal4,YP16, 
B42, His and the like. Toxic reporter genes, such as CAT^, CYH2, CYH1. URA3, 
bacterial and fungi toxins and the like can be used in reverse two-hybrid systems. 

The polypeptides encoded by the nucleotide inserts of the human differentiated 
PAZ6 adipocytes prey library thus prepared are temried "prey polypeptides" in the 
context of the presently described selection method of the prey polynucleotides. 

The bait polynucleotides can be inserted in bait plasmid pB6 or pB5 as illustrated 
in Figures 3 and 2, respectively.. The bait polynucleotide insert is fused to a 
polynucleotide encoding the binding domain of, for example, the Gal4 DNA binding 
domain and the shuttle expression vector is used to transfonn cells. 

The bait polynucleotides used in the present invention are described in Table 1. 

As stated above, any cells can be utilized in transfonning the bait and prey 
polynucleotides of the present invention including mammalian cells, baclerial cells, 
yeast cells, insect cells and the like. 

In an embodiment, the present invention identifies protein-protein interactions in 
yeast. In using known methods a prey positive clone Is identified containing a vector 
which comprises a nucleic acid insert encoding a prey polypeptide which binds to a 
bait polypeptide of interest. The method in which protein-protein interactions are 
identified comprises the following steps: 

mating at least one first haploid recombinant yeast cell clone from a recombinant 

yeast cell clone library that has been transformed with a plasmid containing the 

prey polynucleotide to be assayed with a second haploid recombinant yeast cell 

clone transformed with a plasmid containing a bait polynucleotide encoding for 

the bait polypeptide; 

cultivating diploid cell clones obtained in step i) on a selective medium; and 
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selecting recx>mbinant cell clones which grow on the selective medium. 
This method may further comprise the step of: 

characterizing the prey polynucleotide contained in each recombinant cell clone 
which is selected in step iii). 
5 In yet another embodiment of the present invention, in lieu of yeast, Escherichia 

coli is used in a bacterial two-hybrid system, which encompasses a similar principle 
to that described above for yeast, but does not involve mating for characterizing the 
prey polynucleotide. ^ 

In yet another embodiment of the present invention, mammalian cells and a 
10 method similar to that described above for yeast for characterizing the prey 
polynucleotide are used. 

By perfonning the yeast, bacterial or mammalian two-hybrid system, it is possible 
to identify for one particular bait an interacting prey polypeptide. The prey 
polynucleotide that has been selected by testing the library of preys in a screen using 
15 the two-hybrid, two plus one hybrid methods and the like, encodes the polypeptide 
Interacting with the protein of interest. 

The present invention is also directed, in a general aspect, to a complex of 
polypeptides, polynucleotides encoding the polypeptides composed of a bait 
polypeptide or bait polynucleotide encoding the bait polypeptide and a prey 
20 polypeptide or prey polynucleotide encoding the prey polypeptide capable of 
Interacting with the bait polypeptide of interest. These complexes are identified in 
Table 2. 

In another aspect, the present invention relates to a complex of polynucleotides 
consisting of a first polynucleotide, or a fragment thereof, encoding a prey 
25 polypeptide that interacts with a bait polypeptide and a second polynucleotide or a 
fragment thereof. This fragment has at least 12 consecutive nucleotides, but can 
have between 12 and 5,000 consecutive nucleotides, or between 12 and 10,000 
consecutive nucleotides or between 12 and 20,000 consecutive nucleotides. 

The complexes of the two interacting polypeptides listed in Table 2 and the sets 
30 of two polynucleotides encoding these polypeptides also form part of the present 
invention. 

In yet another embodiment, the present invention relates to an isolated complex 
of at least two polypeptides encoded by two polynucleotides wherein said two 
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polypeptides are associated in the complex by affinity binding and are depicted in 
colunnns 1 and 4 of Table 2. 

In yet another embodiment, the present invention relates to an isolated complex 
comprising at least a polypeptide as described in column 1 of Table 2 and a 
polypeptide as described in column 4 of Table 2. The present invention is not limited 
to these polypeptide complexes alone but also includes the isolated complex of the 
two polypeptides in which fragments and/or homologous polypeptides exhibit at least 
95% sequence identity, as well as from 96% sequence identity to 99.999% sequence 
identity. 

Also encompassed in another embodiment of the present invention is an isolated 
complex in which the SID® of the prey polypeptides encoded by SEQ ID Nos. 34 to 
771 in Table 3 form the isolated complex. 

Besides the isolated complexes described above, nucleic acids coding for a 
Selected Interacting Domain (SID®) polypeptide or a variant thereof or any of the 
nucleic acids set forth in Table 3 can be inserted into an expression vector which 
contains the necessary elements for the transcription and translation of the inserted 
protein-coding sequence. Such transcription elements include a regulatory region 
and a promoter. Thus, the nucleic acid which may encode a marker compound of 
the present invention is operably linked to a promoter in the expression vector. The 
expression vector may also include a replication origin. 

A wide variety of host/expression vector combinations are employed in 
expressing the nucleic acids of the present invention. Useful expression vectors that 
can be used include, for example, segments of chromosomal, non-chromosomal and 
synthetic DNA sequences. Suitable vectors include, but are not limited to, 
derivatives of SV40 and pcDNA and known bacterial plasmids such as col El, pCR1, 
pBR322, pMal-C2, pET, pGEX as described by Smith et al [need cite 1988], pMB9 
and derivatives thereof, plasmids such as RP4^ phage DNAs such as the numerous 
derivatives of phage I such as NM989, as well as other phage DNA such as Ml 3 and 
filamentous single stranded phage DNA; yeast plasmids such as the 2 micron 
plasmid or derivatives of the 2m plasmid, as well as centomeric and integrative yeast 
shuttle vectors; vectors useful in eukaryotic cells such as vectors useful in insect or 
mammalian cells; vectors derived from combinations of plasmids and phage DNAs, 
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such as plasmids that have been modified to employ phage DNA or the expression 
control sequences; and the like. 

For example in a baculovirus expression system, both non-fusion transfer 
vectors, such as, but not limited to pVL941 (SamHI cloning site Summers, pVL1393 
(BamHI, Smal, Xba\, EcoR\, Not\, Xmalll, Bglll and Psfl cloning sites; Invitrogen) 
pVL1392 (Bglil, Psfl, A/ofl, Xmaill, EcoRI, Xba/I, Smal and SamHI cloning site; 
Summers and Invitrogen) and pBlueSaclll (BamHI, Bg/ll, Psfl, A/col and HindWl 
cloning site, with blue/white recombinant screening, Invitrogen), and fusion transfer 
vectors such as, but not limited to, pAc700(SamHI and Kpn\ cloning sites, in which 
the BamHI recognition site begins with the initiation codon; Summers), pAc701 and 
pAc70-2 (same as pAcTOO, with different reading frames), pAc360 (BamHI cloning 
site 36 base pairs downstream of a polyhedrin initiation codon; Invitrogen (195)) and 
pBlueBacHisA, B, C ( three different reading frames with BamHI, Bg/ll. Psfl. A/col and 
H/ndlll cloning site, an N-terminal peptide for ProBond purification and blue/white 
recombinant screening of plaques; Invitrogen (220) can be used. 

Mammalian expression vectors contemplated for use in the Invention include 
vectors with inducible promoters, such as the dihydrofolate reductase promoters, any 
expression vector with a DHFR expresston cassette or a DHFR/methotrexate co- 
amplification vector such as pED (Psfl, Safl, Sbal. Smal and EcoRI cloning sites, with 
the vector expressing both the cloned gene and DHFR; Kaufinan, 1991). 
Alternatively a glutamlne synthetase/methionine sulfoximine co-amplification vector, 
such as pEE14 (H/ndlll. Xfaa/I, Smal, Sbal, EcoRI and Bcfl cloning sites in which the 
vector expresses glutamlne synthetase and the cloned gene; Celltech). A vector that 
directs episomal expression under the control of the Epstein Ban- Virus (EBV) or 
nuclear antigen (EBNA) can be used such as pREP4 (BamHI, Sm, Xho\. A/ofl, Nhe\, 
H/ndlll, Nhe\, PvuW and Kpn\ cloning sites, constitutive RSV-LTR promoter, 
hygromycin selectable marker; Invitrogen) pCEP4 (BamHI, S/71, X/iol, A/ofl, Nhe\, 
H/ndlll, Nhe\, PvuW and Kpnl cloning sites, constitutive hCMV immediate early gene 
promoter, hygromycin selectable marker; Invitrogen), pMEP4 (Kpnl. Pvu\. Nhe\, 
H/ndlll, A/ofl, Xho\, Sfil, BamHI cloning sites, inducible methallothlonein 11a gene 
promoter, hygromycin selectable marker, Invitrogen), pREP8 (BamHI, Xho\, A/ofl, 
H/ndlll, Nhe\ and Kpn\ cloning sites, RSV-LTR promoter, histidinol selectable marken 
Invitrogen), pREP9 {Kpn\, Nhe\, H/ndlll, A/ofl, Xho\, SffI, BamHI cloning sites, RSV- 
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LTR promoter. G418 selectable marker; Invltrogen), and pEBVHis (RSV-LTR 
promoter, hygromycin selectable marker, N-terminal peptide purifiable via ProBond 
resin and cleaved by enterokinase; Invitrogen). 

Selectable mammalian expression vectors for use in the invention include, but 

5 are not limited to, pRc/CMV (H/nofill, SsfXI, Nofl. Sba\ and >Apal cloning sites, G418 
selection, Invitrogen), pRc/RSV (H/nofll, Spel, BsfXl, Not\, Xba\ cloning sites, G418 
selection, Invitrogen) and the like. Vaccinia virus mammalian expression vectors 
(see, for example Kaufman 1991 that can be used in the present invention include, 
but are not limited to, pSC11 (Smal cloning site. TK- and p-gal selection), pMJ601 

10 (Sa/1, Sma\, Afh, NaA, BspMW, BamHI. /\pal, Nhe\, Sacll, Kpn\ and HindlW cloning 
sites; TK- and p-gal selection), pTKgptFIS (EcoRI, Pst\, Sa/ll, Accl, HindW, Sbal 
BamH\ and Hpa cloning sites, TK or XPRT selection) and the like. 

Yeast expression systems that can also be used in the present Include, but are 
not limited to, the non-fusion pYES2 vector (Xtel, Sp^l. ShoU WofI, GsOCI, EcoRI. 

15 BsfKl BamHI, Sad, Kpnl and HindWl cloning sites, Invitrogen), the fusion pYESHisA, 
B. C (Xba/I, Sphl Shol Wofl. BsiXI, EcoRI. BamHI. Sad, Kpnl and HindlW doning 
sites, N-terminal peptide purified with ProBond resin and cleaved with enterokinase; 
Invitrogen), pRS vectors and the like. 

Consequently, mammalian and typically human cells, as well as bacterial, yeast, 

20 fungi, insect, nematode and plant cells an used in the present invention and may be 
transfected by the nucleic acid or recombinant vector as defined herein. 

Examples of suitable cells include, but are not limited to, VERO ceils, HELA cells 
such as ATCC No. CCL2, CHO cell lines such as ATCC No. CCL61. COS cells such 
as COS-7 cells and ATCC No. CRL 1650 cells. W138, BHK, HepG2, 3T3 such as 

25 ATCC No. CRL6361, A549, PCI 2. K562 cells, 293 cells, Sf9 cells such as ATCC No. 
CRL171 1 and Cv1 cells such as ATCC No. CCL70. 

Other suitable cells that can be used in the present invention include, but are not 
limited to, prokaryotic host cells strains such as Escherichia coli, (e.g., strain DH5-a). 
Bacillus subtilis. Salmonella typhimurium, or strains of the genera of Pseudomonas, 

30 Streptomyces and Staphylococcus. 

Further suitable cells that can be used in the present invention include yeast cells 
such as those of Saccharomyces such as Saccharomyces cerevisiae. 
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Besides the specific isolated complexes, as described above, the present 
invention relates to and also encompasses SID® polynucleotides. As explained 
above, for each bait polypeptide, several prey polypeptides may be identified by 
comparing and selecting the intersection of every isolated fragment that are included 

5 in the same polypeptide. Thus the SID® polynucleotides of the present invention are 
represented by the shared nucleic acid sequences of SEQ ID Nos. 34 to 771 
encoding the SID® polypeptides of SEQ ID Nos. 772 to 1509 in columns 5 and 7 of 
Table 3, respectively. 

The present invention is not limited to the SID® sequences as described in the 

10 above paragraph, but also includes fragments of these sequences having at least 12 
consecutive nucleic acids, between 12 and 5,000 consecutive nucleic acids and 
between 12 and 10,000 consecutive nucleic acids and between 12 and 20,000 
consecutive nucleic acids, as well as variants thereof. The fragments or variants of 
the SID® sequences possess at least the same affinity of binding to its protein or 

15 polypeptide counterpart, against which it has been initially selected. Moreover this 
variant and/or fragments of the SID® sequences alternatively can have between 95% 
and 99.999% sequence identity to its protein or polypeptide counterpart. 

According to the present invention variants of polynucleotide or polypeptides can 
be created by known mutagenesis techniques either in vitro or in vivo. Such a 

20 variant can be created such that it has altered binding characteristics with respect to 
the target protein and more specifically that the variant binds the target sequence 
with either higher or lower affinity. 

Polynucleotides that are complementary to the above sequences which include 
the polynucleotides of the SID®'s, their fragments, variants and those that have 

25 specific sequence identity are also included in the present invention. 

The polynucleotide encoding the SID® polypeptide, fragment or variant thereof 
can also be inserted into recombinant vectors which are described in detail above. 

The present invention also relates to a composition comprising the above- 
mentioned recombinant vectors containing the SID® polynucleotides in Table 3, 

30 fragments or variants thereof, as well as recombinant host cells transformed by the 
vectors. The recombinant host cells that can be used in the present invention were 
discussed in greater detail above. 
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The compositions comprising the recombinant vectors can contain physiological 
acceptable carriers such as diluents, adjuvants, excipients and any vehicle in which 
this composition can be delivered therapeutically and can include, but is are not 
limited to sterile liquids such as water and oils. 

In yet another embodiment, the present invention relates to. a method of selecting 
modulating compounds, as well as the modulating molecules or compounds 
themselves which may be used in a pharmaceutical composition. These modulating 
compounds may act as a cofactor, as an inhibitor, as antibodies, as tags, as a 
competitive inhibitor, a? an activator or alternatively have agonistic or antagonistic 
activity on the protein-protein interactions. 

The activity of the modulating compound does not necessarily, for example, have 
to be 100% activation or inhibition. Indeed, even partial activation or inhibition can be 
achieved that is of phamaaceutical interest. 

The modulating compound can be selected according to a method which 
comprises: 

cultivating a recombinant host cell with a modulating compound on a selective 
medium and a reporter gene the expression of which is toxic for said recombinant 
host cell wherein said recombinant host cell is transformed with two vectors: 
wherein said first vector comprises a polynucleotide encoding a firet hybrid 
polypeptide having a DNA binding domain; 

wherein said second vector comprises a polynucleotide encoding a second 
hybrid polypeptide having a transcriptional activating domain that activates said 
toxic reporter gene when the first and second hybrid polypeptides interact; 
selecting said modulating compound which inhibits or pemilts the growth of said 
recombinant host cell. 

Thus, the present invention relates to a modulating compound that inhibits the 
protein-protein interactions of a complex of two polypeptides of columns 1 and 4 of 
Table 2. 

The present invention also relates to a modulating compound that activates the 
protein-protein interactions of a complex of two polypeptides of columns 1 and 4 of, 
Table 2. 
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In yet another embodiment, the present invention relates to a method of selecting 
a modulating compound, which modulating compound Inhibits the Interactions of two 
polypeptides of columns 1 and 4 of Table 2. This method comprises: 

cultivating a recombinant host cell with a modulating compound on a selective 
medium and a reporter gene the expression of which is toxic for said recombinant 
host ceil wherein said recombinant host cell is transformed with two vectors: 
wherein said first vector comprises a polynucleotide encoding a first hybrid 
polypeptide having a first domain of an enzyme; 

wherein said second vector comprises a polynucleotide encoding a second 
hybrid polypeptide having an enzymatic transcriptional activating domain that 
activates said toxic reporter gene when the first and second hybrid polypeptides 
interact; 

selecting said modulating compound which inhibits or pemfiits the growth of said 
recombinant host cell. 

In the two methods described above any toxic reporter gene can be utilized 
including those reporter genes that can be used for negative selection including the 
URA3 gene, the CYH1 gene, the CYH2 gene and the like. 

In yet another embodiment, the present invention provides a kit for screening a 
modulating compound. This kit comprises a recombinant host cell which comprises a 
reporter gene the expression of which is toxic for the recombinant host cell. The host 
cell is transfonned with two vectors. The first vector comprises a polynucleotide 
encoding a first hybrid polypeptide having a DNA binding domain; and the second 
vector comprises a polynucleotide encoding a second hybrid polypeptide having a 
transcriptional activating domain that activates said toxic reporter gene when the first 
and second hybrid polypeptides interact. 

In yet another embodiment, a kit is provided for screening a modulating 
compound by providing a recombinant host cell, as described in the paragraph 
above, but instead of a DNA binding domain, the first vector encodes a first hybrid 
polypeptide containing a first domain of a protein. The second vector encodes a 
second polypeptide containing a second part of a complementary domain of a protein 
that activates the toxic reporter gene when the first and second hybrid polypeptides 
Interact. 
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In the selection methods described above, the activating domain can be p42 Gal 
4, YP16 (HSV) and the DNA-binding domain can be derived from Gal4 or Lex A. The 
protein or enzyme can be adenylate cyclase, guanylate cyclase, DHFR and the like. 
Examples of modulating compounds are set forth in Table 3. 

5 In yet another embodiment, the present invention relates to a pharmaceutical 

composition comprising the modulating compounds for preventing or treating obesity 
or metabolic diseases in a human or animal, most preferably in a mammal. 

This phannaceutical composition comprises a phannaceutically acceptable 
amount of the modulating compound. The pharmaceutically acceptable amount can 

10 be estimated from cell culture assays. For example, a dose can be formulated in 
animal models to achieve a circulating concentration range that includes or 
encompasses a concentration point or range having the desired effect in an in vitro 
system. This information can thus be used to accurately determine the doses in 
other mammals, Including humans and animals. 

15 The therapeutically effective dose refers to that amount of the compound that 

results in amelioration of symptoms in a patient Toxicity and therapeutic efficacy of 
such compounds can be determined by standard pharmaceutical procedures in cell 
cultures or in experimental animals. For example, tiie LD50 (the dose lethal to 50% 
of the population) as well as the ED50 (the dose tiierapeutically effective in 50% of 

20 the population) can be determined using methods known In the art. The dose ratio 
betyveen toxic and therapeutic effects is the therapeutic index which can be 
expressed as the ratio between LD50 and ED50 compounds tiiat exhibit high 
therapeutic indexes. 

The data obtained from the cell culture and animal studies can be used in 
25 formulating a range of dosage of such compounds which lies preferably within a 
range of circulating concentrations that include the ED50 with little or no toxicity. 

The pharmaceutical composition can be administered via any route such as 
locally, orally, systemically, intravenously, intramusculariy, mucosally. using a patch 
and can be encapsulated in liposomes, microparticles, microcapsules, and the like. 
30 The pharmaceutical composition can be embedded in liposomes or even 
encapsulated. 

Any pharmaceutically acceptable carrier or adjuvant can be used in the 
pharmaceutical composition. The modulating compound will be preferably in a 
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soluble form combined with a phamiaceutically acceptable carrier. The techniques 
for formulating and administering these compounds can be found in "Remington's 
Phannaceutical Sciences'* Maok Publication Co., Easton, PA, latest edition. 

The mode of administration optimum dosages and galenic forms can be 
5 determined by the criteria known in the art taken into account the seriousness of the 
general condition of the mammal, the tolerance of the treatment and the side effects. 

The present invention also relates to a method of treating or preventing obesity or 
metabolic diseases in a human or mammal in need of such treatment. This method 
comprises administering to a mammal in need of such treatment a pharmaceutically 
10 effective amount of a modulating compound which binds to a targeted mammalian or 
human or adipocyte protein. In a preferred embodiment, the modulating compound 
is a polynucleotide which may be placed under the control of a regulatory sequence 
which is functional in the mammal or human. 

In yet another embodiment, the present invention relates to a pharmaceutical 
15 composition comprising a SID® polypeptide, a fragment or variant thereof. The SID® 
polypeptide, fragment or variant thereof can be used in a pharmaceutical composition 
provided that it is endowed with highly specific binding properties to a bait 
polypeptide of interest. 

The original properties of the SID® polypeptide or variants or fragments thereof 
20 interfere with the naturally occurring interaction between a first protein and a second 
protein within the cells of the organism. Thus, the SID® polypeptide binds 
specifically to either the first polypeptide or the second polypeptide. 

Therefore, the SID® polypeptides of the present Invention or variants or 
fragments thereof interfere with protein-protein interactions between mammalian or 
25 human adipocyte proteins. 

Thus, the present invention relates to a phannaceutical composition comprising a 
phamiaceutically acceptable amount of a SID® polypeptide or variant or fragment 
thereof, provided that the variant has the above-mentioned two characteristics; i.e., 
that it is endowed with highly specific binding properties to a bait polypeptide of 
30 interest and is devoid of biological activity of the naturally occurring protein. 

In yet another embodiment, the present invention relates to a pharmaceutical 
composition comprising a pharmaceutically effective amount of a polynucleotide 
encoding a SID® polypeptide or a variant thereof wherein the polynucleotide is 
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placed under the control of an appropriate regulatory sequence. Appropriate 
regulatory sequences that are used are polynucleotide sequences derived from 
promoter elements and the like. 

Polynucleotides that can be used in the pharmaceutical composition of the 
present invention include the nucleotide sequences of SEQ ID Nos.34 to 771 . 

Besides the SID® polypeptides and polynucleotides, the pharmaceutical 
composition of the present invention can also include a recombinant expression 
vector comprising the polynucleotide encoding the SID® polypeptide, fragment or 
variant thereof. 

The above described pharmaceutical compositions can be administered by any 
route such as orally, systemically, intravenously, intramusculariy, intradermally, 
mucosally, encapsulated, using a patch and the like. Any phannaceutically 
acceptable earner or adjuvant can be used in this phanmaceutical composition. 

The SID® polypeptides as active ingredients will be preferably in a soluble form 
combined with a pharmaceutically acceptable carrier. The techniques for fonnulating 
and administering these compounds can be found in "Remington's Pharmaceutical 
Sciences^ supra. 

The amount of pharmaceutically acceptable SID® polypeptides can be 
determined as described above for the modulating compounds using cell culture and 
animal models. 

Such compounds can be used in a phamnaceutical composition to treat or 
prevent obesity or any metabolic diseases. 

Thus, the present invention also relates to a method of preventing or treating 
obesity or any metabolic diseases in a mammal sakl method comprising the steps of 
administering to a mammal in need of such treatment a pharmaceutically effective 
amount of: 

(1) a SID® polypeptide of SEQ ID Nos. 772 to 1509 or a variant or a fragment 
thereof which binds to a targeted mammalian or human adipocyte protein; or 

(2) SID® polynucleotide encoding a SID® polypeptide of SEQ ID Nos. 772 to 1 509 
or a variant or a fragment thereof wherein said polynucleotide is placed under the 
control of a regulatory sequence which is functional in said mammal or human; or 

(3) a recombinant expression vector comprising a polynucleotide encoding a 
SID® polypeptide which binds to a mammalian, human adipocyte protein. 

25 
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In another embodiment the present invention nucleic acids comprising a sequence 
of SEQ ID Nos. 34 to 771 which encodes the protein of sequence SEQ ID Nos. 772 
to 1509 and/or functional derivatives thereof are administered to modulate complex 
(from Table 2) function by way of gene therapy. Any of the methodologies relating to 
gene therapy available within the art may be used in the practice of the present 
invention such as those described by Goldspiel et al Clin, Pharm. 12 pgs. 488-505 
(1993). 

Delivery of the therapeutic nucleic acid into a patient may be direct in vivo gene 
therapy (i.e.. the patient is directly exposed to the nucleic acid or nucleic acid- 
containing vector) or indirect ex vivo gene therapy (i.e., cells are first transfonmed 
with the nucleic acid in vitro and then transplanted Into the patient). 

For example for In vivo gene therapy, an expression vector containing the nucleic 
acid is administered in such a manner that It becomes Intracellular, i.e., by infection 
using a defective or attenuated retroviral or other viral vectors as described, for 
example In U.S. Patent 4,980,286 or by Robbins et al, PhamiacoL Then , 80 No. 1 
pgs. 35-47 (1998). 

The various retroviral vectors that are known in the art are such as those 
described in Miller et aL {Meth. Enzymol. 217 pgs. 581-599 (1993)) which have been 
modified to delete those retroviral sequences which are not required for packaging of 
the viral genome and subsequent integration into host cell DNA. Also adenoviral 
vectors can be used which are advantageous due to their ability to infect non-dividing 
cells and such high-capacity adenoviral vectors are described In Kochanek {Human 
Gene Therapy, 10, pgs. 2451-2459 (1999)). Chimeric viral vectors that can be used 
are those described by Reynolds et al. {Molecular Medecine Today, pgs. 25 -31 
(1999)). Hybrid vectors can also be used and are described by Jacoby et al. {Gene 
Therapy. 4. pgs. 1 282-1 283 (1 997)). 

Direct injection of naked DNA or through the use of microparticle bombardment 
(e.g., Gene Gun®; Biolistic, Dupont) or by coating it with lipids can also be used in 
gene therapy. Cell-surface receptors/transfecting agents or through encapsulation in 
liposomes, microparticles or microcapsules or by administering the nucleic acid in 
linkage to a peptide which is known to enter the nucleus or by administering it in 
linkage to a ligand predisposed to receptor-mediated endocytosis (See Wu & Wu, J. 
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Biol. Chem., 262 pgs. 4429-4432 (1987)) can be used to target cell types which 
specifically express the receptors of interest. 

In another embodiment a nucleic acid ligand compound may be produced in which 
the ligand comprises a fusogenic viral peptide designed so as to disrupt endosomes, 
thus allowing the nucleic acid to avoid subsequent lysosomal degradation. The 
nucleic acid may be targeted in vivo for cell specific endocytosis and expression by 
targeting a specific receptor such as that described in WO92/06180, W093/14188 
and WO 93/20221 . Alternatively the nucleic acid may be introduced intracellularly 
and incorporated within the host cell genome for expression by homologous 
recombination (See Zijistra et al, Nature, 342, pgs. 435-428 (1 989)). 

In ex vivo gene therapy, a gene is transferred into cells in vitro using tissue 
culture and the cells are delivered to the patient by various methods such as injecting 
subcutaneously, application of the cells into a skin graft and the intravenous injection 
of recombinant blood cells such as hematopoietic stem or progenitor ceils. 

Cells into which a nucleic acid can be introduced for the purposes of gene 
therapy include, for example, epithelial cells, endothelial cells, keratinocytes, 
fibroblasts, muscle cells, hepatocytes and blood cells. The blood cells that can be 
used include, for example. T-lymphocytes, B-lymphocytes, monocytes, 
macrophages, neutrophils, eosinophils, megakaryotcytes, granulocytes, 
hematopoietic cells or progenitor cells and the like. 

in yet another embodiment the present invention relates to protein chips or 
protein microarrays. It is well known in the art that microarrays can contain more 
than 10,000 spots of a protein that can be robotically deposited on a surface of a 
glass slide or nylon filter. The proteins attach covaiently to the slide surface, yet 
retain their ability to interact with other proteins or small molecules in solution. In 
some instances the protein samples can be made to adhere to glass slides by 
coating the slides with an aldehyde-containing reagent that attaches to primary 
amines. A process for creating microarrays is described, for example by MacBeath 
and Schreiber {Science, Volume 289, Number 5485, pgs, 1760-1763 (2000)) or 
(Service. Science, Vol, 289. Number 5485 pg. 1673 (2000)). An apparatus for 
controlling, dispensing and measuring small quantities of fluid is described, for 
example, in U.S. Patent No. 6,1 12,605. 
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The present invention also provides a record of protein-protein interactions, 
PIM®*s and any data encompassed in the following Tables. It will be appreciated 
that this record can be provided in paper or electronic or digital form. 

In order to fully illustrate the present invention and advantages thereof, the 
following specific examples are given, it being understood that the same are intended 
only as illustrative and in nowise limitative. 

EXAMPLES 

EXAIWPLE 1 : Preparation of a collection of random-primed cDNA fragments 

1A. Collection oreDamtion and transformation in Escherichia coll 

1 .A.1 . Random-primed cDNA fraoment preoaration 

For mRNA sample from differentiated PAZ6 adipocytes, random-primed cDNA 
was prepared from 5 pg of polyA+ mRNA using a TimeSaver cDNA Synthesis Kit 
(Amersham Phamiacia Biotech) and with 5 pg of random N9-mers according to the 
manufacturer's instmctions. Following phenolic extraction, the cDNA was 
precipitated and resuspended in water. The resuspended cDNA was phosphorylated 
by incubating in the presence of T4 DNA Kinase (Biolabs) and ATP for 30 minutes at 
37°C. The resulting phosphorylated cDNA was then purified over a separation 
column (Chromaspin TE 400, Clontech). according to the manufacturer's protocol. 

1 .A.2, Ligation of linkers to blunt-ended cDNA 

Oligonucleotides HGX931 (5' end phosphorylated) 1 ng/jJ and HGX932 l^g/jJ 
were used. 

Sequence of the oligo HGX931: 5'-GGGCCACGAA-3' (SEQ ID NO. 1510) 
Sequence of the oligo HGX932: 5'-TTCGTGGCCCCTG-3 '(SEQ ID NO. 151 1) 
Linkers were preincubated (5 minutes at 95''C, 10 minutes at 68°G, 15 minutes at 
42**C) then cooled down at room temperature and ligated with cDNA fragments at 
16**Covemight. 

Linkers were removed on a separation column (Chromaspin TE 400, Clontech), 
according to the manufacturer's protocol. 
1 .A,3. Vector preparation 

Plasmid pP6 (see Figure 10) was prepared by replacing the SpelfXhot fragment 
of pGAD3S2X with the double-stranded oligonucleotide: 
5'- 

CTAGCCATGGCCGCAGGGGCCGCGGCCGCACTAGTGGGGATCCTTAATTAAAG 
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GGCCACTGGGGCCCCCGGTACCGGCGTCCCCGGCGCCGGCGTGATCACCCCT 
AGGAATTAATTTCCCGGTGACCCCGGGGGAGCT-3' (SEQ ID NO. 1512) 

The pP6 vector was successively digested with Sffi and BamH\ restriction 
enzymes (Biolabs) for 1 hour at 37°C, extracted, precipitated and resuspended in 
water. Digested plasmid vector backbones were purified on a separation column 
(Chromaspin TE 400. Clontech), according to the manufacturer's protocol. 

I.A.4. Ligation between vector and insert of cDNA 

The prepared vector was ligated overnight at 15X with the blunt-ended cDNA 
described in section 2 using T4 DNA ligase (Biolabs). The DNA was then 
precipitated and resuspended in water. 

1 .A.5, Library transfomriation in Escherichia coli 

The DNA from section 1A4 was transformed into Electromax DH10B 
electrocompetent cells (Gibco BRL) with a Cell Porator apparatus (Gibco BRL). 1 ml 
SOC medium was added and the transfonned cells were incubated at 37*^0 for 1 
hour. 9 mis of SOC medium per tube was added and the cells were plated on 
LB+ampicillln medium. The colonies were scraped with liquid LB medium, allquoted 
and frozen at -80*'C. 

The obtained collection of recombinant cell clones was named HGXBPZDRP1. 

1.B. Collection transformation in Saccharomvces cetevisiae 

The Saccharomyces cerevisiae strain (Y187 (MATa Gal4A GalBOA ade2-101, 
his3, leu2-3, -112, trpl-901, ura3-52 URA3::UASGAL1-LacZ Met)) was transformed 
with the cDNA library. 

The plasmid DNA contained in E. coli were extracted (Qiagen) from aliquoted E. 
coli frozen cells (1.A.5.). Saccharomyces cerevisiae yeast Y187 in YPGIu were 
grown. 

Yeast transfonnation was performed according to standard protocol (Glest et ai 
Yeast, 11, 355-360, 1995) using yeast earner DNA (Clontech). This experiment lead 
to 10"* to 5 X 10"^ celis/jig DNA. 2x10"^ cells were spread on DO-Leu medium per 
plate. The cells were aliquoted into vials containing 1 ml of cells and frozen at -80*'C. 
The obtained collection of recombinant cell clones was named HGXYPZDRP1 . 

IC. Construction of bait olasmids 

For fusions of the bait protein to the DNA-binding domain of the GAL4 protein of 
S. cerevisiae, bait fragments were cloned into plasmid pB6. For fusions of the bait 
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protein to the DNA-binding domain of the LexA protein of E. co//, bait fragments were 

cloned into plasmid pB20. 

Plasmid pB6 (see Figure 3) was prepared by replacing the NcoMSah polylinker 

fragment of pB1 (see Figure 1 ) with the double-stranded DNA fragment: 

5' CATGGCCGGACGGGCCGCGGCCGCACTAGTGGGGATCCTTA 
ATTAAGGGCCACTGGGGCCCC 3' (SEQ ID No. 1513) 

5'TCGAGGGGCCCCAGTGGCCCTTAATTAAGGATCGCCACTAGTG 
CGGCCGCGGCCCGTCCGGC 3' (SEQ ID No. 1514) 

Plasmid pB5 (see Figure 2) was prepared by replacing the Ncol/Sall polylinker 

fragment of pB1 with the double-stranded DNA fragment : 

5' CATGGCCGCAGGGGCCGCGGCCGCACTAGTGGGGATCCTTA 
ATTAAGGGCCACTGGGGCCCC 3' (SEQ IS No. 1515) 
5* TCGAGGGGCCCCAGTGGCCCTTAATTAAGGATCCCCACTAGTG 
CGGCCGCGGCCCCTGCGGC 3' (SEQ ID No. 1516) 

The amplification of the bait ORF was obtained by PGR using the Pfu proof- 
reading Taq polymerase (Stratagene), 10 pmol of each specific amplification primer 
and 200 ng of plasmid DNA as template. 
The PCR program was set up as follows : 
94" 45" 
94" 45" ^ 



72" 6' ^ 
72" 10' 

15" 00 

The amplification was checked by agarose gel electrophoresis. 
The PCR fragments were purified with Qiaquick column (Qiagen) according to the 
manufacturer's protocol. 

Purified PCR fragments were digested with adequate restriction enzymes. 
The PCR fragments were purified with Qiaquick column (Qiagen) according to the 
manufacturer's protocol. 

The digested PCR fragments were ligated into an adequately digested and 
dephosphorylated bait vector (pB6 or pB5) according to standard protocol (Sambrook 
et al) and were transfonmed into competent bacterial cells. The cells were grown, the 
DNA extracted and the plasmid was sequenced. 

Example 2 : Screening the collection with the two-hybrid in yeast system 
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2A, The mating protocol 

The mating two-hybrid in yeast system (as described by Legrain et ai. Nature 
Genef/cs, vol. 16, 277-282 (1997), Toward a functional analysis of the yeast genome 
through exhaustive two-hybrid screens) was used for its advantages but one could 
also screen the cDNA collection in classical two-hybrid system as described in Fields 
et ai or in a yeast reverse two-hybrid system. 

The mating procedure allows a direct selection on selective plates because the 
two fusion proteins are already produced in the parental cells. No replica plating is 
required. 

This protocol was written for the use of the library transfomied into the Y187 
strain. 

For bait proteins fused to the DNA-binding domain of GAL4, bait-encoding 
plasmids were first transfonmed into S. cerevisiae (CG1945 strain (MATa Gal4-542 
Gal 1 80-538 ade2-1 01 hls3A200. Ieu2-3,1 1 2. trp1 -901 , ura3-52. Iys2-801 , 
URA3::GAL4 17mers (X3)-CyC1TATA-LacZ, LYS2::GAL1UAS-GAL1TATA-HIS3 
CYH'^)) according to step I.B. and spread on DO-Trp medium. 

For bait proteins fused to the DNA-blnding domain of LexA, bait-encoding 
plasmids were first transformed Into S. cerevlsiae (L40Agal4 strain (MATa ade2, trpl- 
901, Ieu2 3,112. Iys2-801, hls3A200, LYS2::(lexAop)4-HIS3. ura3.52::URA3 
(lexAop)8-LacZ, GAL4::Kan'^)) according to step I.B. and spread on DO-Trp medium- 
Day 1, morning : preculture 

The cells carrying the bait plasmid obtained at step 1 .C. were precultured in 
20 ml DO-Trp medium and grown at 30**C with vigorous agitation. 
Day 1, late afternoon : culture 

The ODgQon^ of the DO-Trp pre-culture of cells carrying the bait plasmid was 
measured. The ODeoonm must lie between 0.1 and 0.5 in order to conrespond to a 
linear measurement. 

50 ml DO-Trp at ODeoOnm 0.006/ml was inoculated and grown overnight at 30''C 
with vigorous agitation. 
Day 2 : mating 
medium and plates 
1 YPGIu 15cm plate 
50 ml tube with 13 ml DO-Leu-Trp-His 
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1 00 ml flask with 5 ml of YPGIu 
8 DO-Leu-Trp-His plates 
2 DO-Leu plates 
2 DO-Trp plates 
5 2 DO-Leu-Trp plates 

The ODeoOnm of the DO-Trp culture was measured. It should be around 1. 
For the mating, twice as many bait cells as library cells were used. To get a good 
mating efficiency, one must collect the cells at 10^ cells per cm^. 

The amount of bait culture (in ml) that makes up 50 ODeoOnm units for the 
10 mating with the prey library was estimated. 

A vial containing the HGXYPZDRP1 library was thawed slowly on ice. 1.0ml of 
the vial was added to 5 ml YPGIu. Those cells were recovered at SO^'C, under gentle 
agitation for 10 minutes. 
l\/latina 

15 The 50 ODeoOnm units of bait culture was placed into a 50 ml falcon tube. 

The HGXYCDNA1 library culture was added to the bait culture, then centrifuged, the 

supernatant discarded and resuspended in 1.6ml YPGIu medium. 

The cells were distributed onto two 15cm YPGIu plates with glass beads. The 

cells were spread by shaking the plates. The plate cells-up at 30°C for 4h30min were 
20 incubated. 

Collection of mated cells 

The plates were washed and rinsed with 6ml and 7ml respectively of DO-Leu- 

Trp-His. Two parallel serial ten-fold dilutions were perfomned in SOOpI DO-Leu-Trp- 

His up to 1/10,000. 50mI of each 1/10000 dilution was spread onto DO-Leu and DO- 
25 trp plates and 50ijl of each 1/1000 dilution onto DO-Leu-Trp plates. 22.4ml of 

collected cells were spread in 400pl aliquots on DO-Leu-Trp-His+Tet plates. 

Day 4 

Clones that were able to grow on DO-Leu-Trp-His+Tetracyclln were then 
selected. This medium allows one to isolate diploid clones presenting an interaction. 
30 The His+ colonies were counted on control plates. 

The number of His+ cell clones will define which protocol is to be processed : 
Upon 60 X 1 0® Trp+Leu+ colonies : 
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if the number His+ cell clones <285 : then use the process luminometry protocol on 
all colonies 

if the number of His+ cell clones >285 and <5000: then process via overlay and then 
luminometry protocols on blue colonies (2.B and 2.C). 

if number of His+ cell clones >5000 : repeat screen using DO-Leu-Trp- 
His+Tetracyclin plates containing 3-aminotriazol. 
2.5. The X'Gal overlay assay 

The X-Gal overlay assay was performed directly on the selective medium plates 

after scoring the number of His* colonies. 

Materials 

A waterbath was set up. The water temperature should be 50**C. 
0.5 M Na2HP04 pH 7.5. 
1.2%Bacto-agar. 
2% X-Gal in DMF. 

Overlay mixture : 0.25 M Na2HP04 pH7.5. 0.5% agar. 0.1% SDS, 7% DMF 
(LABOSI), 0.04% X-Gal (ICN). For each plate, 10 ml overlay mixture are needed. 
DO-Leu-Trp-His plates. 
Sterile toothpicks. 
Experiment 

The temperature of the overlay mix should be between AS^'C and 50**C. The 
overlay-mix was poured over the plates in portions of 1 0 ml. When the top layer was 
settled, they were collected. The plates were incubated overlay-up at SO^'C and the 
time was noted. Blue colonies were checked for regularly. If no blue colony 
appeared, overnight incubation was perfomied. Using a pen the number of positives 
was marked. The positives colonies were streaked on fresh DO-Leu-Trp-His plates 
with a sterile toothpick. 
2.C. The luminometry assay 

His+ colonies were grown overnight at 30°C in microtiter plates containing DO- 
Leu-Trp-His+Tetracyclin medium with shaking. The day after, the overnight culture 
was diluted 15 times into a new microtiter plate containing the same medium and 
was incubated for 5 hours at 30°C with shaking. The samples were diluted 5 times 
and read ODeoonm- The samples were diluted again to obtain between 10,000 and 
75,000 yeast cells/well in 100 ^il final volume. 
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Per well, 76 |xl of One Step Yeast Lysis Buffer (Tropix) was added, 20 ^1 
Sapphirell Enhancer (Tropix). 4 fil Galacton Star (Tropix) and incubated 40 minutes 
at sec. The p-Gal read-out (L) was measured using a Luminometer (Trilux, 
Wallach). The value of (ODeoonmX L) was calculated and interacting preys having the 
highest values were selected. 

At this step of the protocol, diploid cell clones presenting interaction were 
isolated. The next step was now to identify polypeptides involved in the selected 
interactions. 

Example 3 : Identification of positive clones 
3.A. PCR on veast colonies 
Introduction 

PCR amplification of fragments of plasmid DNA directly on yeast colonies is a 
quick and efficient procedure to identify sequences cloned into this plasmid. It is 
directly derived from a published protocol (Wang H. et al.. Analytical Biochemistry. 
237,145-146,(1996)). However, it is not a standardized protocol and it varies from 
strain to strain and it is dependent of experimental conditions (number of cells, Taq 
polymerase source, etc). This protocol should be optimized to specific local 
conditions. 
Materials 

For 1 well, PCR mix composition was : 
32.5 ^il water, 

5 nl 1 0X PCR buffer (Phannacia), 
1 ^ildNTPIOmM. 

0.5 \i\ Taq polymerase (5u/pl) (Pharmacia), 

0.5 jJ oligonucleotide ABS1 10pmole/nl: 5'-GCGTTTGGAATCACTACAGG-3*,(SEQ 
ID No.1517) 

0.5 oligonucleotide ABS2 10pmole/^l: 5'-CACGATGCACGTTGAAGTG-3'.(SEQ 
ID No. 1518) 
1 N NaOH. 
Experiment 

The positive colonies were grown ovemight at 30°C on a 96 well cell culture 
cluster (Costar), containing 150^1 DO-Leu-Trp-His+Tetracyclin with shaking. The 
culture was resuspended and 100 nl was transfen-ed immediately on a Themnowell 
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96 (Costar) and centrifuged for 5 minutes at 4,000 rpm at room temperature. The 
supernatant was removed. 5 ix\ NaOH was added to each well and shaken for 
1 minute. 

The Thermowell was placed in the thermocycler (GeneAmp 9700, Perkin Elmer) 
5 for 5 minutes at 99.9°C and then 10 minutes at 4X. In each well, the PGR mix was 
added and shaken well. . 

The PGR program was set up as followed : 

94*'G 3 minutes 

94X 30 seconds ^ 

10 53*'G 1 minute 30 seconds p=— x 35 cycles 

72''C 3 minutes 

72*'G 5 minutes 

IS'^G 00 

The quality, the quantity and the length of the PGR fragment was checked on an 
15 agarose gel. The length of the cloned fragment was the estimated length of the PGR 
fragment minus 300 base pairs that con-esponded to the amplified flanking plasmid 
sequences. 

3.B. Plasmids rescue from veast bv electroporation 
Introduction 

20 The previous protocol of PGR on yeast cell may not be successful, in such a 

case, plasmids from yeast by electroporation can be rescued. This experiment 
allows the recovery of prey plasmids from yeast cells by transformation of E. coH with 
a yeast cellular extract. The prey plasmid can then be amplified and the cloned 
fragment can be sequenced. . 

25 Materials 

Plasmid rescue 
Glass beads 425-600 |Lim (Sigma) 

Phenol/chloroform (1/1) premixed with isoamyl alcohol (Amresco) 
Extraction buffer : 2% Triton X100. 1% SDS, 100 mM NaGI, 10 mM TrisHGI pH 8.0, 
30 1 mM EDTA pH 8.0. 

Mix ethanol/NH4Ac : 6 volumes ethanol with 7.5 M NH4 Acetate, 70% Ethanol and 
yeast cells in patches on plates. 
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Electro poration 
SOC medium 
M9 medium 

Selective plates : M9-Leu+Ampicillin 

2 mm electroporation cuvettes (Eurogentech) 

Experiment 

Plasmid rescue 

The cell patch on DO-Leu-trp-His was prepared with the cell culture of 
section 2.C. The cell of each patch was scraped into an Eppendorf tube, 300 |il 
of glass beads was added In each tube, then, 200 \si extraction buffer and 200 ^1 
phenohchlorofonnrisoamyl alcohol (25:24:1) was added. 
The tubes were centrifuged for 10 minutes at 15,000 rpm. 
180 |xl supematant was transferred to a sterile Eppendorf tube and 500 \i\ each of 
ettianol/NH4Ac was added and the tubes were vortexed. The tubes were 
centrifuged for 15 minutes at 15,000 rpm at 4X. The pellet was washed with 
200 pJ 70% ethanol and the ethanol was removed and the pellet was dried. The 
pellet was resuspended in 10 |il water. Extracts were stored at -20*C. 
Electroporation 

Materials : Electrocompetent MC1066 cells prepared according to standard protocols 
(Sambrook et al. supra). 

1 of yeast plasmid DNA-extract was added to a pre-chilled Eppendorf tube, and 
kept on ice. 

1 |xl plasmid yeast DNA-extract sample was mixed and 20 ^1 electrocompetent cells 
was added and transfen-ed in a cold electroporation cuvette. 
The Biorad electroporator was set on 200 ohms resistance, 25 jiF capacity; 2.5 kV. 
The cuvette was placed in the cuvette holder and electroporation was performed. 
1 ml of SOC was added into the cuvette and the cell-mix was transfenred into a sterile 
Eppendorf tube. The cells were recovered for 30 minutes at 37**C, then spun down 
for 1 minute at 4,000 x g and the supematant was poured off. About 100 pi medium 
was kept and used to resuspend the cells and spread them on selective plates (e.g., 
M9-Leu plates). The plates were then Incubated for 36 hours at 37*^0. 
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One colony was grown and the plasmids were extracted. The presence and the 
size of the insert were checked for through enzymatic digestion and agarose gel 
electrophoresis. The insert was then sequenced. 
Example 4 : Protein-protein interaction 
5 For each bait, the previous protocol lead to the identification of prey 

polynucleotide sequences. Using a suitable software program (e.g., Blastwun. 
available on the Internet site of the University of Washington 
http://bioweb.pasteur.fr/seQanal/lnterfaces/blastwu.html ) one can determine the 
identity of the mRNA transcript that Is encoded by the prey fragment and whether the 

10 fusion protein encoded is in the same open reading frame of translation as the 
predicted protein or not. 

Alternatively, prey nucleotide sequences can be compared with one another and 
those which share identity over a significant region (60ht) can be grouped together to 
form a contiguous sequence (Contig) whose identity can be ascertained In the same 

IS manner as for individual prey fragments described above. 
Example 5: Identification of SID® 

By comparing and selecting the intersection of all isolated fragments that are 
included in tiie same polypeptide, one can define the Selected Interacting 
20 Domain (SID®) is determined as illusti-ated in Figure 15. The SID® Is illustrated 

in Table 3. 

Example 6 : Mailing of polyclonal and monoclonal antibodies 

The protein-protein complex of columns 1 and 4 of Table 2 is injected into mice 
and polyclonal and monoclonal antibodies are made following the procedure set forth 

25 in Sambrook et al supra. 

More specifically, mice are immunized with an immunogen comprising the above 
mentionned complexes conjugated to keyhole limpet hemocyanin using 
glutaraldehyde or EDC as is well known in the art. The complexes can also be 
stabilized by crosslinking as described in WO 00/37483. The immunogen is then 

30 mixed with an adjuvant. Each mouse receives four injections of 10 pg to 100 pg of 
immunogen, and after the fourth injection, blood samples are taken from the mice to 
determine if the serum contains antibodies to the immunogen. Serum titer is 
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determined by ELISA or RIA. Mice with sera indicating the presence of antibody to 
the immunogen are selected for hybridoma production. 

Spleens are removed from immune mice and single-cell suspension is prepared 
(Harlow et al 1988). Cell fusions are perfomned essentially as described by Kohler et 
al.. Briefly, P365.3 myeloma cells (ATTC Rockville, Md) or NS-1 myeloma cells are 
fused with spleen cells using polyethylene glycol as described by Hariow et al (1989). 
Cells are plated at a density of 2 x 10^ cells/well in 96-well tissue culture plates. 
Individual wells are examined for growth and the supematants of wells with growth 
are tested for the presence of complex-specific antibodies by ELISA or RIA using the 
protein-protein complex of columns 1 and 4 of Table 2 as a target protein. Cells in 
positive wells are expanded and subdoned to establish and confirm monoclonality. 

Clones with the desired specificities are expanded and grown as ascites in mice 
or in a hollow fiber system to produce sufficient quantities of antibodies for 
characterization and assay development. Antibodies are tested for binding to bait 
polypeptide of column 1 of Table 2 alone or to prey polypeptide of column 4 of Table 
2 alone, to detennine which are specific for the protein-protein complex of columns 1 
and 4 of Table 2 as opposed to those that bind to the individual proteins. 

Monoclonal antibodies against each of the complexes set forth in columns 1 and 
4 of Table 2 aro prepared in a similar manner by mixing specified proteins together, 
immunizing an animal, fusing spleen cells with myeloma cells and isolating clones 
which produce antibodies specific for the protein complex, but not for individual 
proteins. 

Exanfiple 6: Modulating compounds identification 

Each specific protein-protein complex of columns 1 and 4 of Table 2 is used to 
screen for modulating compounds. 

One appropriate constmction for this modulating compound screening is: 
bait polynucleotide inserted in pB6 or pB5; 
prey polynucleotide inserted in pP6; 

transformation of these two vectors in a pemrieable yeast cell; 
growth of the transfomied yeast cell on a medium containing compound to be 
tested, 

and observation of the growth of the yeast cells. 
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The following results obtained from these Examples, as well as the teachings in 
the specification are set forth in the Tables below. 

All non-patented websites cited in the present specification are incorporated 
herein by reference. 

While the invention has been described in terms of the various preferred 
embodiments, the skilled artisan will appreciate that various modifications, 
substitutions, omissions and changes may be made without departing from the scope 
thereof. Accordingly, it is intended that the present invention be limited by the scope 
of the following claims, including equivalents thereof. 
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CLAIMS 

What is claimed is: 

1 . A complex between two interacting proteins in adipocyte cells as defined in 
columns 1 and 4 in Table 2. 

2. A polynucleotide encoding a polypeptide in adipocyte cells as defined in 
columns 1 and 4 in Table 2. 

3. A recombinant host cell expressing at least one of the interacting polypeptides 
of said complex of claim 1 . 

4. A method for selecting a modulating compound in adipocyte cells comprising: 

(a) cultivating a recombinant host cell on a selective medium containing a 
modulating compound and a reporter gene the expression of which Is toxic 
for said recombinant host cell wherein said recombinant host cell is 
transfomied with two vectors: 

(i) wherein said first vector comprises a polynucleotide encoding a 
first hybrid polypeptide and a DNA bonding domain; 

(ii) wherein said second vector comprises a polynucleotide 
encoding a second hybrid polypeptide and an activating 
domain that activates said toxic reporter gene when the first 
and second hybrid polypeptides interact; 

(b) selecting said modulating compound which inhibits the growth of said 
recombinant host cell. 

5. A modulating compound obtained from the method of Claim 4. 

6. A SID® polypeptide comprising the SEQ ID Nos. 772 to 1509. 

7. A SID® polynucleotide comprising the SEQ ID Nos. 34 to 771. 

8. A vector comprising the SID® polynucleotide comprising the SEQ ID Nos. 34 
to 771. 

9. A fragment of said SID® polypeptide according to Claim 6. 

10. A variant of said SID® polypeptide according to Claim 6. 

1 1 - A fragment of said SID® polynucleotide according to Claim 7. 

12. A variant of said SID® polynucleotide according to Claim 7. 

13. A vector comprising the SID® polynucleotide according to Claim 1.1 . 

14. A vector comprising the SID® polynucleotide according to Claim 12. 
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15. A recombinant host cell containing tlie vectors according to Claim 8. 

16. A recombinant host cell containing the vectors according to Claim 13. 

17. A pharmaceutical composition comprising a modulating compound of claim 5 
and a pharmaceutically acceptable carrier. 

18. A pharmaceutical composition comprising a SID® polypeptide of SEQ ID 
Nos. 772 to 1 509 and a pharmaceutically acceptable carrier. 

19. A pharmaceutical composition comprising the recombinant host cells of Claim 

15 and a pharmaceutically acceptable canrier. 

20. A pharmaceutical composition comprising the recombinant host cells of Claim 

1 6 and a pharmaceutically acceptable carrier. 

21 .A protein chip comprising the polypeptides of Table 2. 

22. A record comprising all or part of the data set forth in Tables 1 and 2. 
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1/16 



pBl 



287 BfrBI 
287WS/I 
287 f^uKN 



AlwN\ 5760 



Sap\ 5226 



P5/ 1 4776 
Sa/I 4770 
- Bam HI 4764 
Xma\ 4759 
Sn»t 4759 
&y\ 4749 
/Vcol 4749 
Bst D&l 4749 
SffI 4747 
Msci 4746 

4570 




Alias : pAS2DD 
Application : 2HY (bait) 

Specificity: 
Selection : ampicillixi 
Constructed by: 



1285 Xcm\ 
1288 BseRl 



- 1748 BspBl 
1841 Pmf\ 

1956 ApaBi 
1956 as/ API 
2124 fisu 361 
2174 Bs/XI 



a^BI 4026 

fisa Bt 3797 



2680 Drain 
2986 Atoel 
2986 NgoMfV 
3289 /V/vl 
3293 Ed 13611 
3293 Sael 
3364 /l^/ul 
3465 Ppum 
3515 R/»AI 



Oligo 160 
gagagtagtaacaaaggtc 



AAAGACAGTTGACTGTATCGCCG GAA TTT AT 



Smal 



BamHI 



Sail 



PstI 



G GCC ATG .GAG GCC CCG GGG ATC CGT CGA CCT GCA QCC 
OUgo 161 



Ncol 

AAG CTA ATT 



ccgggcgaatttcttatg 



Oligo 160 5' GAGAGTAGTAACAAAGGTC 3' 
Oligo 161 5'CATAAGAAATTCGCCCGG3' 



FIGURE 1 



SUBSTITUTE SHEET (RULE 26) 
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2/16 

pB5 



Seal 6761 



2B7 B/Sr Bl 
2B7 Nsil 
237 f^lOI 



Bpm\ 636B 



>Ui¥Ml 5801 



Sapl 5267 " 



Psfl 4617 
fiffp120l 4805 
>V»( 4805 
•SffI 4797 
Pad 4788 
aamHI 4762 
^1 4774 
• Eagl 4767 
Noil 4766 
Sac 11 4764 
MiuU3i 4764 
4753 
SCKl 4749 
Ncot 4749 
Afecl 4746 
Hpal 4570 




Aiias: pAS2DDNSI 
ApplicatHin!2HY (hait) 
Backbone: pAS2DD 
Specificitv : Sfi non-oriented 
Selection ! ampicillin 
qpn^ctedbyiSW 



1285 Xcm I 
12B8S59RI 



1748 BspB 
1841 An/I 

195S>V»BI 
1956 BsfAPI 
2124 BSU3SI 
2174B5IXI 



2880 Oram 
2986 A/ael 
2986 iU^MIV 
3289 /V/vl 
3293 £c/ 13611 
3293 Sac\ 
3364 Mhrl 
3465 f^MI 
3515 Rto/U 



8sfBf 4026 

fisaBI 3797 



OKgo 160 

gagagtagtaacaaaggtcj AAAGACAGTTGACTGTATCGCCG GAA TTT ATG 

Sfi I Spel Bam HI 

G CC ATG OC C GCA GGG GCC GCG GCCGC A CTA GTG GGG ATC C 
Ncol NotI 

STOP Sfil PstI 

TT AAT|TAa | GGG CCA CTGGQG CCC CTC GAG CTG GAG CCA 
Pad 

OKgo 161 
AGC TAA TT jccgggcgaafflcttat^ 

Oligol60 5'GAGAGTAGTAACAAAGGTC3' 

Oligo 161 5* CATAAGAAATTCGCCCGG 3' figure 2 



SUBSTITUTE SHEET (RULE 26) 
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3/16 



pB6 



287 BfrB\ 
7Sr Nsil 

2sr ppum 



AlwNl 5801 



Sapi 5267 



Psfi 4617 
fiflplZOI 4805 
4805 
•Sffl 4797 
fte( 4786 
daoiHI 4782 
»g| 4774 
4767 
Afti/I 4766 
SacM 4764 
Mifliai 4764 
•Sffl 4m 
SfKl 4749 
/Vbol 4749 
ACscI 4746 
/ij^l 4570 



A pplication : 2HY (bait) 
Backbone: pAS2DD 
Specificity : Sfl oriented 
Selection: ampicfllm 
ConstnictedbvrSW 




1748 B2;p El 
1841 Pmn 

1958 >Mia Bt 
1956 BstAP\ 
2124 8stf 361 
2174 asfXI 



2880 OralN 
MBB AfaBi . 
'2986 NgohKN 
Z2B3 AArvl 
3233 £c/ 13611 
3293 Sact 
3364 Mfkil 
3465 /^Ml 
3515 



BsfBI 4026 
SsaBl 



3797 



Oligo 160 



gagagtagtaacaaaggtcAAAGACAGTTGACTGTATCGCCG GAATTT ATG 

Sac II 



Sfil 



Spel 



Bam HI 



GCC ATG GCC GGA CGG GCCGCG GCCGCACTA GTG GGG ATC C 



Ncol 

STOP 



TT AAT 



TAA 



Sfil 



Not I 
Apal 



PstI 



Pad 



AGC TAA TT 



GGG CCA CTGGGG CCC CTC GAC CTG CAGCCA 
OUgo 161 



ccgggcgaatttcttatg 



Oligo 160 5* GAGAGTAGTAACAAAGGTC3' 
Oligo 161 5'CATAAGAAATTCGCCCGG3' 

SUBSTITUTE SHEET (RULE 26) 



FIGURE 3 
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4/16 



pB13 



165 as/ Bl 



PpulOl 5290 
Nsii 5290 
BirBl 5290 



SraaBI 4936 



AaiH 4160 



Seal 3720 
Pvut 3609 




Bpm\ 3327 
Ahdl 3237 



651 Sdl 
651 Xtot 
709>4»l 
724 BsrGI 
BBS MsG\ 
886 iVcol 
666 Styi 
903 A47U113I 
903 Sacn 

905 A/ofl 

906 £a9l 
913 ^1 
921 damHI 
927ftel 
944>V»I 
944 Ssp 1201 
956 P5tt 

116B Na9\ 
1168 N^MIV 



Alias: pGBT9NSl 
A pplication : 2HY (bait) 
Backbone: pGBT9 
Specificity : Sfi non-oriented 
Selection : ampicilHn 
Constructed bv: CR 



2760 ^Nl 



1440 Bsmfl 

1590 Ssrxi 
1604 £co RV 
1645 an/ 361 
1747 /Mfel 
1803 Xbal 
1809 >^BI 
1809 asr API 
1929 Anil 
2171 Pwill 
2226 Sap\ 
2349 Aff III 
2349 asp LU11I 



OUgo 160 



gagagtagtaacaaaggtc AAAGACAGTTGACTGTATCGCCG GAA TTT ATG 



Sfi I 



Sacn 



Spcl 



Bam HI 



GCC ATG GCC GCA GGG GCC GCG GCC GCA CTA GTG GGG ATC C 



Ncol 



Not I 



STOP 



Sfi I 



PstI 



TT AAT 



Pad 



TAAI GGG CCA CTG GGG CCC CTC GAC CTG CAG CCA 
OUgo 161 



AGC TAA TT ccgggcgaatttcttatg 



Oligol60 5' GAGAGTAGTAACAAAGGTC 3' 
Oligo 161 5' CATAAGAAATTCGCCCGG 3' 



FIGURE 4 
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pB14 



5/16 

165 Bst m 



Ppu 101 5290 
Nsi\ 5290 
BfrBl 5290 



SnaB\ 4936 



Aafll 4160 



Seal 3720 
Pvu\ 3609 




651 Sc/I 
651 Xhol 
709 Hpa] 
724 Bsr Gl 
6B5 Msc I 
888 A/col 
688 Sfy\ 
903 M/U113I 
903 Sac II 
805 Noll 
90BBag\ 
913 Spei 
921 BamH\ 
927 Pac I 
944^31 
944 Bsp 1201 
956ftrfl 



Alias : pGBT9NS2 
A pplication : 2H Y (bailj 
Backbone : pGBT9 
Specificity : Sfi oriented 
Selection : ampiclllin 
Constructed bv : CR 



Bpml 3327 
^1 3237 



2760 AAvNI 



1168 Atoel 
1168 NgoMfS/ 



1440 Bsmn 

1590 fisf XI 
1604 Eco RV 
1645 351/361 
1747 Mfe\ 
1803 Xbal 
1609 i4paBI 
1809 BstAPi 
1929 ftn/l 
2171 Pvuil 
2226 Sfltpl 
2349 ^Ul 
2349a4|>LU11l 



Oligo 160 
gagagtagtaacaaaggtc 



AAAGACAGTTGACTGTATCGCCG GAA TTT ATG 



Sfi I 



Sacn 



Spel 



Bam HI 



G CC ATG GC C OGA CGG GCC GCG GCC GCA CTA GTG GGG ATC C 
Ncol Not I 



STOP 



Sfi I 



Apal 



PstI 



TT AAT 



TAA 



GGG CCA CTG GGG CCC CTC GAC CTG GAG CCA 



Pad 



OUgo 161 



AGC TAA TT ccgggcgaatttottatg 



OIigol60 5 GAGAGTAGTAACAAAGGTC 3' 

Oligo 161 5'CATAAGAAATTCGCCCGG3' 
^ FIGURES 
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6/16 



pB20 



161 Bstm 



Alias: pLcxlONS2 
A pplication: 2HY (bait) 
Backbone : pLex 10 (pB9) 
S pectficttv: SG-oiieaited 
Selection : ampicillin 
rnnstnictedliv: LP 



f^lOt 5451 
Nsii 5451 
BItBl 5451 
Aflf3B7/7l 5445 
Aval 5445 



Aatt\ 4321 



Seal 3881 
Pm/I 3770 




447 H/n cll 
447 Wpal 
486 6c/l 
517 Af/ul 

759 Bsmi 
957 Anel 
975 a^il 
99BXcffil 
1045fiDi>RI 
1053 SffI* 
1064fisfDSt 
1064 miM3l 
1064 Sacll 

1066 Atofr 

1067 Eagt 
1074 

1082 BamHI 
1088 P^cl 
1097 SffI * 
1105 

1105fi2p120r 
1117 Ml 

1329 iVaei 
1329 ATgoMIV 



Bpmt 3488 
Bsal 3470 
Ahd\ 3398 



/UivNI 2921 



1601 BsmFl 

1751 BsfXI 
1806 BSU36I 

1964 Xbal 
1970/^BI 
1970 asl API 
2090 Pnai 
2332 Pi^n 
23Br Sap\ 
2510ajpLU11l 



EcoRI 



Sfil 



Not I 



Spel 



GAA TTC GGG GCC GGA CGG G CC GCG G CC GCA CTA GTG 

Sacn 

STOP 



BamHI 



GGG ATC C TT AAT |rAA| G GG CCA CTG GGG CCC CTC GAG 
Pad Siil 



CTG CAG 
PstI 



FIGURE 6 
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7/16 



ppi 



Bdl 7932 
Spe\ 7917 

Xbal 7533 



377BsaHI 
435Stal 



7194 



846 0531 
/,913yUlrfl 



fpal 6552 



ttm\ 6146 
BIpl 6111 
Bael 6056 



5se8647l 5778 

fisfEII 5622 
aa\ 5506 
Ago\ 5407 




Alias :pACTnst 

A pplication : 2H Y (pny) 
Backbone : pACIlI 
Specificity: 
Selection : ampiciUin 
CDDStructed bv: 



1392AAVNI 



1926 Sapl 

2059 7Vi111t 
2157BsmBI 



acoRV4900 



BsrGI 4536 

Drain 4092 



ABSl 



2575 Bsgl 

2764 RsrH 
2B0BSaxM 
2934 MAil 
3094 61171 
3096 A/col 
3106 Smal 
3106X9181 
3111 BamHl 
3117£coRt 
3123fiBnll 
3123 fitf 13611 
3123 Sad 
3143 Sdl 
3143X1101 



cgtttggaatcactacagc GAiui i lAATACCACTACAATOQATCiATGTATATAACTATCTATT 



JC90 



Bein 

CCXiAAAAAAAGAOATCTGTATGGCTTACCCAT^^ 

Sfil Smal BamHI 

ATTACGCTAC3CTTOGOTQGTCATATGG CC ATG G AG GCC OCG OGG ATC CQA ATT 

Ncol 

SacI Xhol B^n 

CGA OCT CGA CTA OCT AGC TGA CTC GAG AGA TCT ATGAAT 



GCAAGTT 



162 



cacttcaad stgcatcgtg 



caccatctcaatttc 



ABS2 



53 



ABSl S'CGTTFGGAATCACTACAGG3* 

JC90 5' CGAT GATGAA GATACCCCACCAAA3' 

1€2 y GGGGnTTTCAGTATCTACG 3* 

ABS2 5'CACGATGCACAGTTGAAGT63' 

S3 5* GAAATT6AGATGGTGCACGATGCAC3* 



FIGURE 7 
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8/16 



pP2 



Bd\ 7947 



Xbal 754B 



377 058 HI 
43SScal 



Snam 7209 



846 0531 
913>VMfl 



6567 



Nru\ 6161 
Biipl 6126 
Bael 6071 



Ssa 86471 5793 

BstBl 5637 
C/al 5521 
^SI9l 5422 




£3^Sbm£:jiACnist 
Specificfty^Sfi non-orieoted 
Seiection- ampicillin 
Constnictiyd ^rfiu/ 



1392 AfwNI 



19ZB Sspl 



^ Tttim 
2157 BsmBt 



2575 0^1 



£coRV4924 



BsrGi 4551 

Oralll 4107 



2764 /%srn 

2905 5&rAI 
2934 Mb/f 
3096 Atol 
3100 Sff I • 
3111 MftrllSI 
3111 Sacll 
3129 0afnHI 
3135 PacI 
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